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' Abstract 

We investigate the role of nondeterminism in Winfree's abstract Tile Assembly Model (aTAM), 
^ ' which was conceived to model artificial molecular self-assembling systems constructed from 

O . DNA. Designing tile systems that assemble shapes, due to the algorithmic richness of the aTAM, 

is a form of sophisticated "molecular programming" . Of particular practical importance is to 
find tile systems that minimize resources such as the number of distinct tile types, each of which 
' corresponds to a set of DNA strands that must be custom-synthesized in actual molecular im- 

plementations of the aTAM. We seek to identify to what extent the use of nondeterminism in 
tile systems affects the resources required by such molecular shape-building algorithms. 

By nondeterminism we do not mean a magical ability such as that possessed by a nonde- 
terministic algorithm to search an exponential-size space in polynomial time. Rather, we study 
t/3 , realistically implementable systems that retain a different sense of determinism in that they are 
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guaranteed to produce a unique shape but are nondeterministic in that they do not guarantee 
which tile types will be placed where within the shape. A sensible analogy is a nondeterministic 
' algorithm that outputs the same value on all computation paths for a given input. Such an al- 

gorithm can always be replaced by an equivalent deterministic algorithm with the same running 
time, memory usage, and program length. It is then intuitively reasonable to conjecture that a 
QQ , similar equivalence should hold between deterministic tile systems and those nondeterministic 

PsJ ' tile systems that always "output" the same shape. 

, This intuition is wrong. We first show a "molecular computability theoretic" result: there is 

' an infinite shape S that is uniquely assembled by a tile system but not by any deterministic tile 

, system. We then show an analogous phenomenon - using a different technique - in the finitary 

"molecular complexity theoretic" case: there is a finite shape S that is uniquely assembled by a 
tile system with c tile types, but every deterministic tile system that uniquely assembles S has 
more than c tile types. In fact we extend the technique to derive a stronger (classical complexity 
theoretic) result, showing that the problem of finding the minimum number of tile types that 
' uniquely assemble a given finite shape is Zj-complete. In contrast, the problem of finding the 

minimum number of deterministic tile types that uniquely assemble a shape was shown to be 
NP-completc by Adleman, Cheng, Gocl, Huang, Kcmpc, Moisset de Espanes, and Rothcmund 
{Combinatorial Optimization Problems in Self- Assembly, STOC 2002). 

The conclusion is that nondeterminism confers extra power to assemble a shape from a small 
tile system, but unless the polynomial hierarchy collapses, it is computationally more difficult 
to exploit this power by finding the size of the smallest tile system, compared to finding the size 
of the smallest deterministic tile system. 
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1 Introduction 



Tile self-assembly is an algoritlimically rich model of "programmable crystal growth" . It is possible 
to design molecules (square-like "tiles") with specific binding sites so that, even subject to the 
chaotic nature of molecules floating randomly in a well-mixed chemical soup, they are guaranteed 
to bind so as to deterministically form a single target shape. This is despite the number of different 
types of tiles possibly being much smaller than the size of the shape and therefore having only 
"local information" to guide their attachment. The ability to control nanoscale structures and 
machines to atomic-level precision will rely crucially on sophisticated self-assembling systems that 
automatically control their own behavior where no top-down externally controlled device could fit. 

A practical implementation of self-assembling molecular tiles was proved experimentally fea- 
sible in 1982 by Seeman [38] using DNA complexes formed from artificially synthesized strands. 
Experimental advances have delivered increasingly reliable assembly of algorithmic DNA tiles with 
error rates of 10% per tile in 2004 [36], 1.4% per tile in 2007 [17], and 0.13% per tile in 2009 [8]. 
Erik Winfree [44] introduced the abstract Tile Assembly Model (aTAM) - based on a constructive 
version of Wang tiling [42,43] - as a simplified mathematical model of self-assembling DNA tiles. 
Winfree demonstrated the computational universality of the aTAM by showing how to simulate 
an arbitrary cellular automaton with a tile assembly system. Building on these connections to 
computability, Rothemund and Winfree [35] investigated a self-assembly resource bound known 
as tile complexity, the minimum number of tile types needed to assemble a shape. They showed 
that for most n, the problem of assembling an n x n square has tile complexity O ^ ^J"^^^ ^ , and 

Adleman, Cheng, Goel, and Huang [4] exhibited a construction showing that this lower bound is 
asymptotically tight. Under natural generalizations of the model [2,6,9-13,19,20,29,39,41], tile 
complexity can be reduced for tasks such as square-building and assembly of more general shapes. 

There are different interpretations of "nondeterminism" in the aTAM. We say a tile system 
is directed (a.k.a. deterministic) if it is guaranteed to form one unique final assembly, where an 
assembly is defined not only by which positions are eventually occupied by a tile, but also by which 
tile type is placed at each position. We say a tile system strictly (a.k.a. uniquely) self-assembles 
a shape if all of its final assemblies are guaranteed to have that shape. A natural analogy may 
be made between a non-directed tile system that strictly self-assembles some shape and a nonde- 
terministic Turing machine A'" that always produces the same output on a given input, regardless 
of the nondeterministic choices made during computation. There is always a deterministic Turing 
machine M computing the same function as N and using no more "resources", according to any 
common resource bound such as time complexity, space complexity, or program length. Therefore 
we regard such a restricted class of nondeterministic Turing machines as no more "powerful" than 
deterministic Turing machines. 

Based on this analogy, it might seem that strict self-assembly, while allowing one form of 
nondeterminism (which tile goes where), so strongly requires another form of determinism (which 
positions have a tile) that extra power cannot be gained by allowing the tile systems to be non- 
directed. More precisely, it is natural to conjecture that every infinite shape that is strictly self- 
assembled by some tile system is also strictly self-assembled by some directed tile system. In the 
finitary case, every finite shape is assembled by a directed tile system (possibly using as many tile 
types as there are points in the shape), so to make the idea non-trivial we might conjecture that the 
tile complexity of a finite shape is independent of whether we consider all tile systems or only those 
that are directed. Such conjectures are appealing because the algorithmic design and verification of 
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tile systems [39] as well as lower bounds and impossibility proofs [6, 15,30] often rely on reasoning 
about directed tile systems, which are "better behaved" in many senses than arbitrary tile systems, 
even those that strictly self-assemble a shape. It would be helpful to begin such arguments with 
the phrase, "Assume without loss of generality that the tile system is directed." 

However, these conjectures are false. We show that there is an infinite shape S that is strictly 
self-assembled by a tile system but not by any directed tile system. Therefore, in a "molecu- 
lar computability theoretic" sense, nondeterminism allows certain shapes to be algorithmically 
self-assembled that are totally "unassemblable" (to borrow Adleman's tongue-twisting analog of 
"uncomputable" [3]) under the constraint of determinism. We then show an analogous phenomenon 
in the finitary case: there is a finite shape S that is strictly self-assembled by a tile system with c 
tile types, but every directed tile system that strictly self-assembles S has more than c tile types. In 
fact to strictly self-assemble the shape in a directed tile system requires more than ~ |c tile types. 
It is open to improve the constant | to be larger or to prove a super-linear gap between the com- 
plexity measures; the issue is discussed in more detail in Section 5. This establishes a "molecular 
complexity theoretic" analog of the first result. We then derive a stronger result, showing that the 
problem of finding the minimum number of tile types that strictly self-assemble a given finite shape 
is complete for the complexity class = NP'^'^. In contrast, the problem of finding the minimum 
number of directed tile types that strictly self-assemble a shape was shown to be NP-complete by 
Adleman, Cheng, Goel, Huang, Kempe, Moisset de Espanes, and Rothemund [5]. 

Based on these results, we conclude that nondeterminism confers extra power to assemble a 
shape from a small tile system, but unless the polynomial hierarchy collapses, it is computationally 
more difficult^ to exploit this power by finding the size of the smallest tile system, compared to 
finding the size of the smallest directed tile system. 

One might argue that this difference between nondeterministic (but "output-deterministic") 
Turing machines and non-directed (but strict) tile systems is not surprising, since there is a "mono- 
tone" aspect to tile assembly in the sense that space used to place a tile cannot be reused, whereas 
a tape cell used to store information by a Turing machine can be reused to store different informa- 
tion later. It is sometimes said that the difference between space and time is that "you can reuse 
space but you cannot reuse time." However, the "computation" carried out by tile systems does 
not distinguish well between space and time. For instance, the standard simulation of a Turing 
machine by a tile system (see [35]) assembles a structure encoding the entire space-time config- 
uration history of the Turing machine. Even with negative glue strengths that are able to force 
detachments to occur, the volume requirements of such a simulation must be proportional to t-s for 
a Turing machine using time t and space s [14], essentially forcing the solution to contain multiple 
assemblies that collectively encode the entire computation history. Tile systems therefore cannot 
reuse space (tiles), which is the fundamental effect of their monotonicity on their computational 
abilities. From this perspective, a Turing machine cannot reuse time any better than any other 
computational system (barring the use of closed timelike curves [1]), yet in contrast to tile systems, 
a nondeterministic but "output-deterministic" Turing machine remains no more powerful, even in 
the sense of time complexity, than a deterministic Turing machine. 

^ "More difficult" in the sense of nondeterministic time complexity, although it is conceivable that both problems 
have the same deterministic time complexity. 
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2 Abstract Tile Assembly Model 



This section gives a terse definition of tfie abstract Tile Assembly Model (aTAM, [44]). This is not 
a tutorial; for readers unfamiliar with the aTAM, [35] gives an excellent introduction to the model. 

Fix an alphabet S. S* is the set of finite strings over S. Given a discrete object O, (O) denotes 
a standard encoding of O as an element of S*. Z, Z"*", and N denote the set of integers, positive 
integers, and nonnegative integers, respectively. For a set 'Pi-A) denotes the power set of A. 
Given A (17?, the full grid graph of A is the undirected graph = {V, E), where V = A, and for 
all u,v £ V, {u,v} G E -4=^ ||n — 'i;||2 = 1; i.e., iff u and v are adjacent on the integer Cartesian 
plane. A shape is a set S* C such that is connected. A shape T is a tree if is acyclic. 

A tile type is a tuple t G (S* x N)^; i.e., a unit square with four sides listed in some standardized 
order, each side having a glue g £ T,* xN consisting of a finite string label and nonnegative integer 
strength. We assume a finite set T of tile types, but an infinite number of copies of each tile 
type, each copy referred to as a tile. An assembly is a nonempty connected arrangement of tiles 
on the integer lattice Z^, i.e., a partial function a : 1? T such that G^on^Q, is connected and 
dom a 7^ 0. The shape Sa C Z^ of a is dom a. Two adjacent tiles in an assembly interact if 
the glues on their abutting sides are equal (in both label and strength) and have positive strength. 
Each assembly a induces a binding graph GJ^, a grid graph whose vertices are positions occupied 
by tiles, with an edge between two vertices if the tiles at those vertices interact.^ Given r € Z"*", a 
is T-stable if every cut of G^ has weight at least r, where the weight of an edge is the strength of 
the glue it represents. That is, a is r-stable if at least energy r is required to separate a into two 
parts. When r is clear from context, we say a is stable. Given two assemblies a,/3 : Z^ --^ T, we 
say a is a subassembly of /5, and we write a C /?, if C Sj3 and, for all points p € Sa, Oi{p) = /3{p). 

A tile assembly system (TAS) is a triple T = {T,a,T), where T is a finite set of tile types, 
a : 1? T is the finite, r-stable seed assembly, and r € Z"*" is the temperature. Given two 
r-stable assemblies a,fi:l? --^ T, we write a -^J (3 if a Q (3 and |S'^\S'q,| = 1. In this case we say 
a T-produces j3 in one step.^ If a -^J /3, Sp\Sa = {p}, and t = /3(p), we write fi = a + {p ^ t). 
The T -frontier of a is the set d^a = U^_j.r^ Sfs \ Sa, the set of empty locations at which a tile 
could stably attach to a. 

A sequence of A; € Z'^" U {oo} assemblies oq, ai, . . . is a T -assembly sequence if, for all 1 < i < fc, 
aj_i —>l ai. We write a /3, and we say a T-produces /3 (in or more steps) if there is a T- 
assembly sequence ao, ai, . . . of length k = \Sp\Sa\ + ^ such that 1) a = a^, 2) Sp = IJo<i<A: ^<^i-> 
and 3) for all < i < A;, a,/ E /3. If /c is finite then it is routine to verify that /3 = a^-i.^ We say a 
is T-producible if cr ^''^ q, and we write A[T] to denote the set of T-producible assemblies. The 
relation is a partial order on A[T] [23,34]. A T-assembly sequence ao,ai,. . . is fair if, for all 
i and all p € d'^Ui, there exists j such that otj{p) is defined; i.e., no frontier location is "starved". 

An assembly a is T -terminal if a is r-stable and d'^a = 0. We write w4n[7^ ^ -^[T^ to 
denote the set of T-producible, 7~-terminal assemblies. A TAS T is directed (a.k.a., deterministic, 

^For = {Vs^,EsJ and = {V^,E^), Gl is a spanning subgraph of Gg^ : Va = Vs^ and C Es^ ■ 
^Intuitively a -^J /? means that a can grow into j3 by the addition of a single tile; the fact that we require both a 
and /? to be r-stable implies in particular that the new tile is able to bind to a with strength at least r. It is easy to 
check that had we instead required only a to be r-stable, and required that the cut of /3 separating a from the new 
tile has strength at least r, then this implies that /3 is also r-stable. 

*If we had defined the relation — based on only finite assembly sequences, then — would be simply the reflexive, 
transitive closure (— J-i")* of ^-i"- But this would mean that no infinite assembly could be produced from a finite 
assembly, even though there is a well-defined, unique "limit assembly" of every infinite assembly sequence. 
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confluent) if the poset (^[7^,^' ) is directed; i.e., if for each a, /3 G ^[7^, there exists 7 € A[T] 
such that a -^'^ 7 and /3 -^'^ 7.^ We say that a TAS T strictly (a.k.a. uniquely) self- assembles 
a shape S* C if, for ah a G >ln[7^, Sa = S; i.e., if every terminal assembly produced by T 
has shape S. If T strictly self-assembles some shape S, we say that T is strict. Note that the 
implication "T is directed =^ T is strict" holds, but the converse does not hold. 

In this paper we will always use singly-seeded temperature- 2 TAS's, those with 15^1 = 1 and 
r = 2; hence we will use the term seed tile for a as well, and for the remainder of this paper we 
use the term TAS to mean singly-seeded temperature-2 TAS. When T is clear from context, we 
may omit T from the notation above and instead write — >i, — >, da, frontier, assembly sequence, 
produces, producible, and terminal. Since the behavior of a TAS T = (T, a, 2) is unchanged if every 
glue with strength greater than 2 is changed to have strength exactly 2, we assume henceforth 
that all glue strengths are 0, 1, or 2, and use the terms null glue, single glue, and double glue, 
respectively, to refer to these three cases.® We also assume without loss of generality that every 
single glue or double glue occurring in some tile type in some direction also occurs in some tile type 
in the opposite direction, i.e., there are no "effectively null" single or double glues. ^ 

3 Assembly of Infinite Shapes 

In this section we study the power of nondeterminism in assembling infinite shapes. The following 
theorem is the main result of Section 3. 

Theorem 3.1. There is a shape S d 1? such that some TAS strictly self-assembles S, but no 
directed TAS strictly self- assembles S. 

Proof. Let L C N be a language that is computably enumerable but not decidable, and let M be 
a Turing machine such that L = L[M). Let S be the shape that is strictly self-assembled by the 
TAS described below, when M is encoded into the TAS as described. 

A portion of the shape S is shown in Figure 1. The TAS that strictly self-assembles S is based on 
the main construction of Lathrop, Lutz, Patitz, and Summers [22]. In that paper, the authors show 
that for each Turing machine M , an encoding of the language L{M) C N accepted by M "weakly 
self-assembles" on the x-axis. More precisely, for a "reasonably simple" function / : N — )• N, a 
special tile type is placed at position (/(n),0) if and only if n € L{M). The n^^ "ray" in Figure 
1 begins growth just before (/(ri),0), and grows independently of the other rays, controlling an 
adjacent simulation of M{n) in parallel with all the other rays. The slope of each ray is just a bit 
smaller than the previous, with the slope approaching 2 as n — )• 00. The simulation executes one 
transition of M on input n every ~ 2" rows of the ray. Since M can use no more than k tape 
cells after k transitions, this slowed simulation ensures that each ray has enough space to allow a 
potentially unbounded simulation of M on each n, without "crashing" into the next adjacent ray, 
even in the worst case that M moves its tape head right on every transition. 

What is needed from this construction for our purpose is: 

^The following two convenient characterizations of "directed" are routine to verify. T is directed if and only if 
|^n[7^| = 1- T is not directed if and only if there exist a,/3 £ A\T\ and p £ Sa C\ Sp such that 7^ Pip)- 

®We use null bond, single bond, and double bond similarly to refer to the interaction of two tiles. 

^Thus the existence of a tile with a double glue facing empty space implies that the empty space is part of the 
frontier. Many of our arguments use the contrapositive that if a shape S is strictly self-assembled by a tile system 
and a side of a tile faces a point p ^ S, then the tile cannot have a double glue on that side. 
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Figure 1: A portion of an infinite shape S that strictly self-assembles, but not by any directed TAS. 
The n*^ ray simulates a Turing machine M on input n, and a vertical line is present under that 
ray if and only if M accepts n. Y and N are points at these positions representing "yes" and "no" 
instances of L{M), respectively, and elements of Y are the points where nondeterminism is forced 
to occur in any TAS that strictly self-assembles S. 



1. / is computable. 

2. The simulation of M{n), carried out adjacent to the n^^ ray, sends a "signal" crawling down 
the right side of the simulation if and only if M accepts n, placing a special tile just above 
the "planter" (the group of tiles growing below each of the rays). 

We modify the signal so that, rather than growing all the way to the planter, for input n, the signal 
grows to distance n north of the planter and then grows a width-1 vertical line n positions down to 
the planter, using the same tile type with equal north and south double glues to "crash" into the 
planter. To ensure that the downward-growing vertical lines do not obstruct the operation of the 
planter, the planter is modified so that it is guaranteed to grow horizontally a sufficient number of 
tiles before laying out the input for the M, so as to guarantee that there is something present for 
a "controlled crash." The space for the downward-growing line of length n to drop after the input 
is accepted is created by having the Turing machine simulations begin not immediately above the 

^ [22] defines the roughly quadratic function f{n) = ("+^) + (n + 1) [logn\ + 6n - ^ Our version of 

this function will grow just a bit faster, to make room for a vertical line to form between two adjacent rays without 
"touching" the rest of the shape except at the endpoints of the line, but retains computability. 
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planter, but at height n on input n. This is why the n ray grows straight up for n rows before 
beginning its sloped growth. Under every simulation, a "notch" tile is placed above the planter 
using a double glue, which is horizontally lined up with where the vertical line will grow if M 
accepts. The actions of the ray, planter and Turing machine simulation are otherwise similar to 
the mechanisms used in [22]. We note that this particular TAS is not directed since the "notch" 
tiles compete nondeterministically with the vertical line tiles at positions where M accepts. 

It remains to show that no directed TAS strictly self-assembles S. Intuitively, we show that 
at points of the form (/(n),0), any directed TAS must place tiles that "know" whether there will 
eventually be a vertical line above the point, implying the ability to decide L since vertical lines 
appear above exactly those positions (/(n), 0) such that n £ L. Assume for the sake of contradiction 
that there is a directed TAS T = (T, a, 2) that strictly self-assembles S, and let a G An [T] be its 
unique producible, terminal assembly. Since the heights of the vertical "bases" of each ray below the 
sloped portion are strictly increasing, there is some no € N such that, for all n > hq, the distance 
from {f{n), 1) to the ray above it is at least |T| + 1. Let Y = { (/(n),0) | n G L and n > uq } he 
the bottommost points of the vertical lines adjacent to rays corresponding to (sufficiently large) 
"yes" instances of L, and let iV = { (/(n),0) \ n ^ L and n > no } represent the positions of the 
"notches" corresponding to (sufficiently large) "no" instances. Y and N are shown in Figure 1. Let 
Ty = ct{Y) and Tjy = a{N) be the set of tile types that appear at "yes" and "no" instance points, 
respectively. Since S has empty space immediately north of positions in A^, no tile type in T^r has 
a north double glue. 

We claim that Ty n Tjy = 0- For the sake of contradiction, suppose otherwise, let t € Ty n Tjy, 
and let p € y be a point where a{p) = t. Since t G Tjv, t has no north double glue, so the vertical 
line above p must grow downward using north and south double glues. Let q = p + (0,1) be the 
point just above p. By our choice of no, the vertical line must repeat a tile type before reaching the 
point q, so all tile types in the repetition period have a north and a south double glue, including 
the tile type t' = a{q). Let t" be the tile type appearing beneath t' after the previous occurrence 
of t' in the vertical line. Since t" has a north double glue, t" Tjy, so t" ^ t. Because t binds to 
the rest of a only through its south double glue, there can be no precedence relationship enforcing 
that p must contain a tile before q (or any other point) receives a tile. In other words, there exists 
a producible assembly (3 £ A[T] such that /3(g) = t' and /3{p) is undefined. This implies that t" 
can bind to /? at position p to create /?' = /3 + (p i-^- t"), contradicting the directedness of T since 
13', a € A[T] but /3'{p) = t" ^ t = a{p). This verifies the claim that Ty n Ta? = 0. 

For all n € N, let p„ = (/(n),0). Since Ty n T/v = 0, for all n > no, n £ L <^=^ pn G 
Y a{pn) € Ty, and n ^ L <^=^ pn £ N a(pn) S T^r. Using this fact, we describe an 

algorithm to decide L, contradicting its undecidability and completing the proof. On input n € N, 
if n < no, use a constant lookup table to decide n. Otherwise, compute pn = (/(n),0). Simulate 
the assembly of T with a fair assembly sequence, maintaining a first-in, first-out queue of frontier 
locations to enforce fairness, until a tile is placed at position p„. Since this assembly sequence is 
fair, the simulation will eventually place a tile type a{pn) at p„, and a(p„)'s membership in Ty or 
Tat will indicate whether to accept or reject n. □ 

We have implemented the tile assembly system that strictly self-assembles S: 
http : //www. dna. caltech.edu/~ddoty/pnsa/ 

It can be simulated using Matthew Patitz's ISU TAS simulator [33] available here: 
http : //www . cs . iastate . edu/~lnsa/ software . html 
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The purpose of the implementation is not to quantitatively analyze the construction, since we 
make no quantitative claims about either the shape being assembled nor the TAS that strictly 
self-assembles the shape. Furthermore, the bulk of the intellectual effort in the proof of Theorem 
3.1 is proving the negative result that no directed TAS strictly self-assembles the shape, which is 
something that cannot be established through a simulation. We provide the simulation primar- 
ily to help the interested reader understand the details of the construction and help to convince 
oneself that the shape really can be strictly self-assembled and to directly observe how the TAS 
accomplishes this task. 



4 Assembly of Finite Shapes 

In this section we study the power of nondeterminism in assembling finite shapes. We first show 
that a finitary analog of Theorem 3.1 holds, by showing that the tile complexity of some shapes 
can be reduced using nondeterminism. The ideas in this construction will be useful in proving the 
main theorem of this section, which shows that the minimum tile set problem is 5l2-complete. 

Recall that all of the TAS's we study are assumed singly-seeded. Let 5 C be a shape. The 
(temperature- 2) tile complexity of S is 

C^^{S) = min { \T\ \ T = {T, a, 2) is a TAS and T strictly self-assembles S } , 

with the convention min0 = oo. The (temperature- 2) directed tile complexity of S is 

(-idtc^^-j _ j^-j^ I I ji| I Y = {T,a,2) is & directed TAS and T strictly self-assembles S } . 

We are interested in the problems, given a finite shape, what is its tile complexity, and what is its 
directed tile complexity? We define two decision problems that are equivalent to these optimization 
problems. Let J^S C 'P{1?) denote the set of all finite shapes. The minimum tile set problem is 

MinTileSet = {{S,c) I 5 G 7-5, c e Z+, and C*"(5) < c } , 

and the minimum directed tile set problem is 

MinDirectedTileSet = { (5, c) 5 e TS, c G Z+, and &^^{S) < c } . 

Adleman, Cheng, Goel, Huang, Kempe, Moisset de Espanes, and Rothemund [5] showed that the 
problem MinDirectedTileSet is NP-complete. In Section 4.2 we show that MinTileSet is 
Z2 -complete, where = NP'^'^. See [7] for a discussion of these complexity classes. 



4.1 A Finite Shape for which Nondeterminism Reduces Tile Complexity 

Although the main result of Section 4, Theorem 4.3, together with the (widely-believed) assumption 
that NP / Z2 and the fact proven in [5] that MinDirectedTileSet e NP, implies Theorem 4.2 
of this subsection, we prove Theorem 4.2 explicitly in order to illustrate some of the reasoning used 
in the proof of Theorem 4.3. 

Given a shape S (possibly a subshape of a larger shape we wish to self-assemble), we say some 
tile types hard-code S to mean that there are |5| unique tile types, each one specific to a position 
in S, using double glues between tile types of all adjacent positions in S. 
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Given a shape S with a subshape S' C S, we say S' is an isolated subshape of S if there is a 
point p (z S' such that every path from a point in S' to a point in S\S' includes p. In this case, we 
say p is the rooi of the subshape. If S' is a tree, we say it is an isolated subtree of S. We say that 
an isolated subshape S' of S is singly- connected if there is precisely one point in S \ 5' adjacent to 
the root of S'. 

Lemma 4.1. Let S be a shape with at least one cycle and St Q S be a singly- connected isolated 
subtree of S with root r G St- Any TAS that strictly self-assembles S places at least C^'^{St) unique 
tile types in St- 

Proof- Let T be a TAS that strictly self-assembles S, and let a G ^n[7^5 so that Sa = S. 

To begin with, we claim that a(r), the tile type on the root r, does not appear anywhere else 
in St- For the sake of contradiction, suppose there were a position p £ St\ {r} with a{p) = a{r). 
Since St is assumed to be a tree, there is a unique path p = {po, . . . ,Pm) S between r and 

p, such that po = r and Pm = P- Let p' be the position in S \ St that is adjacent to r. Depending 
on whether r — p' = p — Pm-i holds or not, we have two cases to be investigated. 

The first case is when this equation holds. Note that in this case there must exist a position 
Pm+i S St such that pi — r = Pm+i — Pm- This is because a{r) and a{pi) are bound via double 
glue. If the seed of T is in 5'\5't, then we can replace the singly-connected subtree of St rooted at 
pi with the subtree Si of St rooted at Pm+i- It is impossible that the growth of Si was blocked in 
a, since Si is a tree, and this replacement enables it to grow further. Hence, T could self-assemble 
a shape strictly smaller than S. However, this contradicts that T strictly self-assembles S- This 
argument works also when r has a third adjacent point, which is in St, and the seed is in the 
singly-connected subtree rooted at the point. If the seed is on the path p between r and p, then 
T could repeat this path when the growth reaches p, and then continue the self-assembly process 
after the repetition in the same way as done in the expected assembly at p. This growth cannot be 
blocked by any tile on S" \ because if it were, then r would be on a cycle in S, contradicting the 
fact that St is a subtree of S. No tile on St can block it either because St is a tree. Thus, T cannot 
set its seed location on p. The remaining possibility is when the seed is in the singly-connected 
subtree rooted at p (let us denote it by S2)- Then T could grow the sub-assembly of the shape 
S \St Sit p instead of the path reaching to r. This alternative assembly process is not blocked by 
S2- Furthermore, the growth oi S \ St in the expected assembly is not blocked by anything but 
tiles in S \St- Thus, the alternative assembly would be strictly smaller than S. 

Let us consider the other case when the equation r — p' = p — pm-i does not hold. In this case, 
there must exist points p'm+iiQ ^ ^t satisfying r — p' = p — p'^+i, r — q = p — Pm~i, and q 7^ p' . If 
the seed is in the singly-connected subtree rooted at p'^+i, then at p, T could proceed its assembly 
in a manner expected to occur at q because C({q) can attach to a{p). This results in a terminal 
assembly strictly smaller than 5. Otherwise, after reaching p, T could grow the subassembly of 
shape S \ St from p. This contradicts the fact that St is a tree. 

This claim has been verified so that a{r) never appears on St \ {r}- By replacing the glue of 
the side that faces S \St with a null glue, and furthermore putting the seed on r if the seed of T 
is in 5 \ St, then we can construct a new TAS T' that strictly self- assembles St without changing 
any tiles on St \ {r}- Thus any TAS which strictly self-assembles S needs at least C^'^{St) tile 
types to assemble St- □ 

If 5" is a tree, the analogous result of this lemma does not hold. Let us consider a tree 
U = {(0,2),(0, 1),(0,0),(1,0),(2,0),(2,1),(2,2)} and let J = C/\{(0,2)}. Easily we can see 
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that C*=(J) = 6, while C*=(?7) = 5. 

Theorem 4.2. There is a finite shape S Cl? such that C^^iS) < C'^*'=(5). 

Proof. The shape S is shown in Figure 2. In the fohowing, the loop L means the shape which 
consists of Lq, Li, . . . , i/i, L'q, L[, . . . , L'^, the tile between Lf^ and L^, and the tile between Lq and 




Figure 2: A finite shape S for which C*'^(S') < (5"). Nondeterminism is forced to occur at the 
two-color top-middle position of the loop L, since any minimal tile set must reuse the tile types 
from subtrees A and B to create L. 



The height h is left as a variable parameter; increasing h increases the gap between C^^{S) and 
(-jdtc^^-j_ Let us index the tile positions on the pillar A from its bottom as Ai, A2, ■ ■ ■ , A^ and do 
the same for B as Bi, B2, ■ ■ ■ , B^. In a similar manner, the left and right pillars of the loop L are 
indexed as Li, L2, ■ ■ ■ , and L'2, . . . , L'^, respectively. 

First we establish that C^^{S) < 2h + 16 (actually with equality, but we only require and only 
prove an upper bound). If the seed is placed in the bottom row, 12 tile types (including the seed) 
hard-code the gray positions, h + 2 tile types hard-code the subtree A, and h + 2 tile types hard- 
code the subtree B. The tile types at Aq, . . . ,Ah can be reused at Lq, . . . , L^, and the tile types at 
Bq, . . . , Bh can be reused at Lq, . . . , L'^. Note that this TAS is not directed because the top- middle 
position of the loop could receive either a tile from ^ or a tile from B. Furthermore, these must 
be different tile types, because the top-right tile type in A must have a double glue on its west but 
cannot have a double glue on any other side, whereas the top-left tile type of B must have a double 
glue on its east but not on any other side. 

We now show that this nondeterminism is necessary to achieve minimum tile complexity. In 
particular, we will show that C'^*'^(5') > 3h. Let St be the tree which consists of the pillars A, B and 
the three tiles connecting them. Let T be a directed TAS that strictly self-assembles S. Lemma 4.1 
implies that any TAS that strictly self-assembles S needs 0*^(5^) tile types to assemble St, and 
due to Theorem 4.3 in [5], C*'=(S't) = 2h + 5. 

First we consider the case when T places its tiles such that every pair of adjacent tiles on the 
loop L is bound via double glue. Being singly-seeded, either the left or the right pillar of the loop 
L does not contain the seed; assume without loss of generality that the left pillar does not. Since 
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all tiles on the loop are double-bonded, the left pillar can grow upward as Li —)• L2 ^ •••—)■ -L^. 
Note that the tile on the top-middle position of the loop L should be different from the one on the 
end of the pillar A; otherwise the gap between the ends of A and B would be filled with the tile 
on L^. This means that the left pillar cannot reuse the tiles on A, and trivially it cannot reuse the 
ones on B. Thus, in this case, T contains at least 3/i tile types. 

Next we consider the case when some of adjacent tiles on the loop L are not bound via double 
glue. Note that at most 2 such weak bonds can appear on the loop, and furthermore, they must 
be incident on a single tile. Thus, we assume without loss of generality that such a weak bond 
does not occur on the left pillar. Depending on whether the bond between and the tile on the 
top-middle of L is weak or not, there are two subcases to be investigated. If it is weak and the seed 
is not on the right pillar, a similar argument as above enables us to see that any tiles oti A 01 B 
cannot be reused for the right pillar (the right pillar must grow upward because of the weak bond) . 
If the bond is weak and the seed is on L^, then on L[^^, . . . , L^, T cannot put any tile placed on A 
or B. Furthermore, if T reuses some tile from A 01 B and places it on some of L'^, . . . , L'-, then the 
bottom row could extend to the left of the loop into empty space. Finally, we consider the second 
subcase when binds to the tile on the top-middle position via double glue, but L'j does not bind 
to -Z^j+i via double glue for some 1 < j < h. This establishes that the left pillar must be hardcoded 
by h new tile types, whence T contains at least 3h tile types. □ 

4.2 The Minimum Tile Set Problem is Z2-compIete 

The following is the main theorem of Section 4. 
Theorem 4.3. MinTileSet is Y.2- complete. 



Proof. To show that MinTileSet G , define the verification language 



MinTileSetv 



(5, c, T, a) 



S ^TS,c(^l^ ,T = (T, cr, 2) is a TAS with 
|r| < c, a = (cT, a2, as, . . . , Ofc) is a T-assembly 
sequence with Sa^ = 5, and is T-terminal 



Clearly MinTileSetv g P. MinTileSet g because (S*, c) G MinTileSet if and only if there 
exists T = (T, fj, 2) with |r| < c such that for all T-assembly sequences a = (o", Q2, . . . , Ofc) of 
length k =\S\, {S,c,T,a) e MinTileSet^, with \{T) \ and |(a)| bounded by 0(| (5, c) p). 

To show that MinTileSet is Z^-hard, we show that 3VCNF-Unsat MinTileSet, where 
BVCNF-Unsat is the Z2 -complete language [37,40,45] 



3VCNF-UNSAT 



is a true quantified Boolean formula ip = 3x\/y^(j){x,y), 
where cp is an unquantified CNF formula with n + m 
input bits x = xi,. . . ,Xn and y = yi, ■ ■ ■ ,ym 



We follow a similar strategy to the reduction of SSat to MinDirectedTileSet shown in [5]. The 
<^-reduction {(p) i-)- (5, c) works as follows. First, we compute a tree T G TS that "represents" ip 
with subtree gadgets that encode possible variable assignments and their effect on clauses. We then 
process T with the polynomial-time algorithm described in [5] that computes the minimum number 
of tile types needed to strictly self-assemble a tree. Let T = (T, cr, 2) be this minimal TAS that 
strictly self-assembles T, and let c = |T|. We then compute a shape S G J-S such that T C S with 
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Xj X2 X3 jjj middle gadgets top gadgets right satisfied right unsatisfied false true false true 

gadgets gadgets 



Figure 3: The shape S of the reduction (99) H> {S, c) showing 3VCNF-Unsat <^ MinTileSet. In 
this example, the quantified negated CNF formula (/? = 3xiy^(j){x,y) has clauses Ci,C2 and C3, 
3-variables xi,X2, and 2:3, and V- variables yi and 1/2 • The "matrix" of gadgets at the top left has 
a row of gadgets for each clause and a column of gadgets for each variable. The matrix sits atop a 
group of "pillars" that, when tiled by actual tiles, will represent a variable assignment to (j) (along 
with one taller left-boundary pillar to help initiate cooperative binding of gadgets to assemble the 
matrix). The tree T is 5" without the matrix and pillars beneath it. In the zoom-in, the two yellow 
lines above the yellow X represent strength-1 glues that cooperate to place the gray gadget once 
(enough of) the black gadgets to its west and south are in place. The yellow X shows "backward 
growth" of the gray gadget that is blocked before it can grow down far enough to form a new copy 
of the bottom row of S. 



the property that, if is true, then the tile types in T can be modified, solely through changing 
some null glues to be single or double glues, producing a TAS T' = (T',a,2) with \T'\ = \T\ = c 
such that T' strictly self-assembles S, and if (p is false, then no TAS with at most c tile types can 
strictly self-assemble S. The shape S is shown in Figure 3. In Figure 3, the height of pillars is set 
to a number bigger than 20£, where i is the number of variables in p.^ 

Suppose that <j) has k clauses Ci,...,Cfc and i = n + m input variables vi,...,Vi, where 
vi, . . . ,Vn = xi, . . . ,Xn are the 3-variables of and Vn+i ■ ■ ■ ,Vi = yi, . . . ,ym are the V- variables of 
ip. A clause C is satisfied by variable v ii C contains literal v and v is true, or if C contains literal 
-If and V is false. For each 1 < i < k and 1 < j < i, define the following six gadgets: 

1. SSTjj: Ci is satisfied by Vp for some I < p < j, and Vj is true. 

'Actually, it is enough to set the height of pillars to any number bigger than the width of the clause-variable 
matrix. 
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SST UUT UST SSF UUF USF 

Figure 4: Six main varieties of "information-bearing" tree gadgets used in the reduction. The 
position where the gadget is intended to go in the matrix is encoded in binary. Gadgets 

intended for the top row are missing the top "T/F" bumps, and gadgets intended for the right 
column are different on the right depending on whether the clause is satisfied or not, as shown in 
Figure 3. 

2. SSFij-: Ci is satisfied by Vp for some 1 < p < j, and vj is false. 

3. UUTjj: Ci is unsatisfied by Vp for every 1 < p < j, and Vj is true. 

4. UUFjj-: Ci is unsatisfied by Vp for every 1 < p < j, and vj is false. 

5. USTjj: d is unsatisfied by Vp for every 1 < p < j , Ci is satisfied by vj, and vj is true. 

6. USFjj: Ci is unsatisfied by Vp for every 1 < p < j , Ci is satisfied by Vj, and Vj is false. 

Each of these six main varieties of "information-bearing" gadgets is shown in Figure 4. Each gadget 
is designed to minimize the amount of "potential unwanted cooperative strength- 1 binding" when 
they are placed next to each other in the "matrix" of gadgets in the upper left of Figure 3.^*^ Each 
gadget encodes the integers i and j, as well as encoding the information about the clause Ci and 
variable Vj as described above. Some of the "boundary case" gadgets are shaped slightly differently 
than those in Figure 4. li i = k {a "top gadget"), the top of the gadget will not encode information 
about the truth value of the variable Vi. li j = I (a "right gadget"), the right side of the gadget 
will still encode whether the clause is satisfied, but the gadget will have a different shape than for 
1 < J < ^- These special boundary shapes are shown in Figure 3. 

Not all six varieties of gadgets are created for each (i,j); the only gadgets created are those 
that are logically consistent with some variable assignment to (j). The matrix and pillars portion 
of S (i.e., S \ T) depends only on the number of 3-variables, the number of V- variables, and the 
number of clauses. The remainder of the information about ip is encoded in the following choices 
about which gadgets to create in T. In the case of j = 1, the gadgets SSTji and SSFji are not 
created. For any clause Ci in which the literal Vj (resp. -^Vj) does not appear, the gadget USTjj 
(resp. USFjj) is not created. Similarly, for any clause Ci in which no literal Vp (resp. ^Vp) appears 

Strength- 1 glues can only have an effect on growth of gadgets in the matrix when they are on tiles on the gray 
positions r, s, and t in Figure 4, if the tile types used to assemble those gadgets in the matrix are the same as those 
used to assemble the gadgets in T. This is useful in proving the converse direction of the reduction by showing that 
if a tile assembly system with < c tile types strictly self-assembles S, then tp must be true. 
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for any 1 < p < j, the gadget SSTjj (resp. SSFjj) is not created. Finally, for any clause Ci in 
which the literal Vj (resp. ^Vj) does appear, the gadget UUTjj (resp. UUFjj) is not created. 

The tree T is S" without the "matrix" on the top left and the "pillars" beneath it that connect 
it to the bottom row. Let c = C'^'^(T). We assume that the seed is placed on the rightmost position 
of the bottom row, for both the shapes T and S. At the end of the proof we show how to modify 
the shapes to enforce this restriction. The steps needed to complete the proof are divided into 
several lemmas. These lemmas are proven after the current proof. Lemmas 4.4 and 4.7 establish 
each direction of the claim that (p is true <^=^ C*'^(S') < c. Intuitively, since T is a "tree-like" 
subshape of S (despite the leftmost tiles intersecting cycles in 5), any tile system that strictly 
self-assembles S must place tiles in the bottom row that do not appear anywhere else in T. is 
true =^ C^^{S) < c because we can modify the null glues of tiles in the left half of the bottom row 
of T to be double glues matching those tile types from the pillars on the right to grow the pillars 
on the left. In the case of the 3-variables x we choose an assignment by our choice of double glues. 
In the case of V-variables y we have no choice; we must allow both the "false" and "true" pillars 
to grow and nondeterministically compete to assign a bit to each yi. We can then modify null 
glues in the gadgets and pillars to be single glues that propagate information about the neighbors 
of a gadget to allow a new gadget encoding the proper information to be placed in the matrix. 
Therefore the assembly of the matrix "evaluates (j){x, y)" and if it is false, strictly self-assembles 
S. The reverse direction is more tedious to establish. Again, since T is a "tree-like" subshape of 
S, any TAS strictly self-assembling S already uses c tile types just to assemble the T portion of S 
(derived from Lemma 4.5). Therefore to assemble all of S using c tile types requires reusing these 
same tile types. Our gadget design, together with the properties of minimal tile sets for trees, allow 
us to conclude that the only way to tile the matrix is "using the gadgets in the way they were 
intended" , which means the rightmost vertical bar of the matrix cannot form unless at least one 
clause is not satisfied; i.e., is true. 

To handle the placement of the seed, define a £ S to be the rightmost point on the bottom row 
of S. Make two copies of 5, place one directly above the other but without touching, and connect 
the copies by a width-1 "bridge" of length h that connects to each copy of a on a's right side. 
Denote this new shape by S' . It is routine to show using techniques similar to those in the proof of 
Lemma 4.1 that any minimal TAS for S' uses C^^{S) + h tile types, places the seed in the bridge, 
uses h tile types to grow the bridge and uses the tile types of a TAS T that is minimal (subject to 
the restriction that T places the seed at a) for S, to assemble each copy of S. Let c' = c + h. Our 
reduction outputs {S',c'), rather than {S,c). By the arguments above concerning S, we have that 
ip is true C*^(5') < c', whence 3VCNF-Unsat <p MinTileSet. □ 

In the following lemmas, (p denotes an arbitrary (true or false) quantified Boolean formula 
of the form (p = 3xVy-i0(a;, y), where (p is an unquantified CNF formula with n + m input bits 
X 37l , . . . , Xn and y = yi, . . . , y^. T,S G FS{I?) refer to the tree and shape constructed from ip 
as in the proof of Theorem 4.3, and c = C*'^(T). 

Lemma 4.4. If ip is true, then C*'^(«S') < c. 

Proof. Let 7t = (T, cr, 2) be a minimal TAS that strictly self-assembles T with seed placed at 
position a, the rightmost point of the bottom row of T, and let a € .An[7T] be the unique terminal 
producible assembly of Tr, such that Sa = T. Theorem 4.3 of [5] shows that if 7t is a minimal TAS 
for T, then 7t puts the same tile type in two positions pi,P2 G T if and only if the subtrees of T 
rooted at pi and p2 (with the seed location considered the root of T) are isomorphic and "identically 
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entered" (meaning both of them have their parent in the same direction). This theorem is stated 
for directed TAS's but it is easy to show that any minimal TAS for a tree must be directed. Given 
a and t & T, define t to be singular in a if it appears exactly once in a. Thus, all the positions at 
the bottom row of T will receive tile types that are singular in a. 

Since f is true, there is an assignment / to variables xi,X2, ■ ■ ■ ,Xn such that any assignment 
to 2/1) 2/2, ■ ■ ■ 5 2/m makes (j){x, y) false. Since all the tile types in the bottom row are unique, we can 
change the north glues of the tiles at the base of the clause-variable matrix, without ruining the 
rest of the shape. We change the north glues so that the blue pillar grows as the leftmost pillar and 
for each variable Xi we grow the red true/false pillar depending on f{xi) being true or false. For 
the positions corresponding to y variables, we change the north glues so that both true and false 
green pillars can grow. 

We will also change a number of null glues into single glues in the following way: the set of 
labels with strength one will be T, F, Sjj, and Vij, where 1 < i < k and 1 < j < ^. For each 
gadget G, let p be the position of its branch point in T, as shown in Figure 4. Then, we change 
the null glue of the south side of a{s) to a single glue with label T if G is of type SST, UUT, or 
UST; otherwise, we change the south glue to a single glue with label F. Also, we change the north 
glue of a(s) to a single glue with label Sij if G is a gadget for the ith clause and jth variable and 
is of type SST or SSF; otherwise, if G is of type UUT, UUF, UST, or USF, the north glue of a{s) 
will be a single glue with label Vij. Note that, the tile type a{s) is singular in a, due to the fact 
that its rooted subtree contains the encoding about the gadget type, clause number, and variable 
number, and hence, is not isomorphic to any other subtree. 

The north glue of the tile type a(r) will be changed to a strength one glue with label T if G is 
of type SST, UUT, or UST; otherwise, its label will be F. The south glue of the tile type a{t) will 
be changed to a strength one glue with label Sj j+i if G is for the ith clause and jth variable and 
is of type SST, SSF, UST, or USF; otherwise, its label will be Ujj+i. 

Applying the above-mentioned changes will give us a TAS Ts with the same tile complexity as 
7t- In 75, a gadget can grow at the cell of the matrix corresponding to clause i and variable j using 
cooperation of single glues of the bottom and left gadget if and only if its notches match the notches 
of the bottom and left gadget. And, the notches of gadgets are designed in a way that they can be 
put together to assemble S if and only if the truth assignment to x and y variables (presented as 
pillar notches at the first row of the matrix) make (/>(x, y) false. Intuitively, the gadgets grow in the 
matrix so as to "evaluate (j){x, y)" on inputs x and y encoded in the pillars, with the pillars encoding 
y nondeterministically choosing values for each of the yi . If we choose the proper assignment / to 
X, such that, for all assignments to y (corresponding to different terminal assemblies), (p{x,y) is 
false (such an assignment / exists since ip = 3x\/y^(j){x , y) is true), then all of these assemblies will 
have at least one unsatisfied right gadget in the rightmost column of the matrix, and the assembly 
will have shape S. Therefore Ts strictly self-assembles S. □ 

From here until the end of the section, assume that Ts = {Ts,cr, 2) is a TAS that strictly self- 
assembles S with the seed placed at the rightmost position on the bottom row. Also, B represents 
the subshape of S that does not have the black matrix on the left, but has the pillars beneath it, 
and I C 7? denotes the set of k x i positions (where k is the number of clauses in cp and i is the 
number of variables) marked by small circles in Figure 3. A set /' C J is called a staircase if the 
following implication holds: 

[{xi,yi) G /',(x2,2/2) e I,X2< xi, and y2 < yi] =^ (2:2,2/2) G I' ■ 
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Let G* = {^Q(zg G, where each element of ^ is a set of tile positions of a gadget created in the 
proof of Theorem 4.3 (Figure 4) translated so that the branch tile is at the origin (so, Q has at 
most k X I X 6 elements). 

Lemma 4.5. // \Ts\ < c = C*'^(T), then there is a TAS Tr = (Tx,o", 2) that can be obtained from 
Ts = {Ts,a',2) by only changing a number of double glues to null glues (in particular implying that 
\Ty \ = \Ts\), such that Tr strictly self-assembles T. 

Proof. It suffices to show that if a G w4.n[7s'], then for any two adjacent positions pi,p2 €= , a{pi) 
and a{p2) are bound together by a double glue. This will establish that, by adjusting double glues 
on the north of tiles beneath the pillars to be null glues, all of T can grow from the tiles without 
growing any of the pillars or matrix of S* — T. Because each adjacent tile in the row under the 
matrix double bonded, each of these tile types must be singular in a \ T (a restricted to T); 
otherwise, their appearance elsewhere in a would lead to copies of the pillars and matrix growing 
in a second location. 

For the sake of contradiction, let pi £ T and p2 € T he the closest positions to the seed that 
are adjacent to each other but a{pi) and a{p2) do not have double glue between them. Then, 
they are on the bottom row below the clause-variable matrix in S. Thus, there must be a pillar 
growing down from the clause- variable matrix; consider the pillar that grows down in a the earlier 
than the others. This pillar cannot reuse any tile type that is used in positions to the right of pi 
in T; otherwise, an undesirable part of T can grow to the left of the downward pillar. Therefore, 
the number of tile types in Ts is at least C*'^(T) — x + y, where x is the horizontal width of the 
clause-variable matrix and y is the height of the pillars. This is a contradiction to the assumption 
that Ts uses at most C*'^(T) tile types, since we set y > x in our construction. □ 

The following lemma states the "inductive step" of the proof of Lemma 4.7. Namely, if gadgets 
of a minimal tile system for S grow to fill in part of the matrix "in the way we intend" , then the 
only way to extend this growth to fill in an additional gadget is also "in the way we intend." 

In the following lemma, "right branch" and "top branch" refer to the two subtrees of a gadget 
rooted at the branch as shown in Figure 4. 

Lemma 4.6. Suppose Ts has at most c = C*'^(T) tile types. Let a G ^[7s] be a producible assembly 
such that 

1. B Q Sa Q S . (where B is the subshape of S that does not have the black matrix on the left, 
but has the pillars beneath it) 

2. Sa r] I is a staircase of cardinality m < \I\. 

3. All tile types in a{Sa H /) are branch tiles of gadgets. 

4. The right and top branches of tiles in Sa^I are present in a. 

5. If there exists p G I such that p ^ Sa, then Sa ri {p + G*) = . 

Then there exists an assembly j3 € ^[Ts] such that a ^ f5 and requirements (l)-(5) are satisfied 
with "a " replaced by "13 " and "m " replaced by "m + 1 
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Proof, a cannot be a terminal assembly, since Ts strictly self-assembles S ^ Sa- 

Since double glues cannot be added to gadget tile types without ruining the tree shape portion 
of S, a must grow by cooperation of single glues. This cooperation can happen only at F — (4,3), 
where 

F = {p £ I \ Sa ■ {Sa n I) U {p} is a staircase}. 

In other words, F — (4, 3) is the set of points marked s in Figure 4, which are adjacent to points 
marked r and t in neighboring gadgets. Let /3' be a minimal assembly producible from a such 
that 5/3' n F is not empty. Let {p} = Sp' H F. All paths from Sa to p in /3' must pass through 
s = p — (4, 3), due to the minimality of /?'. Even if the tile type that goes in s is not taken from 
any gadgets, the tile type that eventually goes to s + (1, 1) must be chosen from a gadget, since it 
should be able to grown the zig-zag shape only by double glues, and the zig-zag shape is used only 
in one of the gadgets. Thus, /3'(p) is a branch tile type. 

Since p € Sp' D I and p ^ Sa I , Spi PI / has cardinality at least m + 1. Let j3 be the minimal 
assembly producible from /?' in which the right and top branches of p are tiled. (3{p) and I3{q) for 
all q in the branches of p must be tile types from the correct gadget to ensure consistency with the 
notches of neighboring gadgets and consistency with the (row, column) identifier notches at the top 
of the right branch. 

Due to the minimality of /3, it satisfies condition 5. □ 
Lemma 4.7. If Ts has at most c tile types, then ip is true. 

Proof. First we show that B is Ts-producible. According to Lemma 4.5, by changing a number 
of double glues in Ts to null glues, we can obtain a TAS 7t that strictly self-assembles T. So, 
T is Ts-producible. Moreover, as can be checked in the proof of Lemma 4.5, the null glues in 7t 
that are double glues in Ts are the north glues of the tile types that appear at the base of pillars 
beneath the gadget matrix. Also, all the pillars must grow from the bottom row to the matrix, and 
not downward, because growing a pillar downward requires adding a double glue to a tile type in 
the matrix area, which will also ruin T. Thus, B, which is T together with the pillars beneath the 
matrix, is T^-producible. This establishes the base case. 

Let f{xi) be true if the red true pillar is used to grow the pillar corresponding to Xj and be 
false if the red false pillar is used. Using Lemma 4.6 for the inductive case, we conclude that there 
is an assembly a such that B L) I Q Sa and valid gadgets are/can be used to fill the matrix part 
of a. By our construction of gadgets, this implies that the truth assignment f to x makes (pix, y) 
false for every value of y. Thus (p is true. □ 

5 Conclusion 

We have investigated the power of nondeterminism for the strict self-assembly of shapes in the 
abstract Tile Assembly Model. We showed that for both the infinite and finite cases, even when 
the shape is required to be strictly self-assembled, nondeterminism can help to assemble the shape, 
by making strict self-assembly possible in the infinite case, and reducing tile complexity in the finite 
case. Furthermore, the problem of finding the minimum tile set to strictly self-assemble a shape is 
strictly harder (in the sense of nondeterministic time complexity) than that of finding the minimum 
directed tile set that does so, unless NP = II2. 

There are some interesting questions that remain open: 
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1. What is the fastest growing function / : N ^ N for which one could prove a statement of the 
form "For infinitely many n £ N, there is a finite shape 5 C such that 0*^(5) < n and 
Qdtc(5) > /(n)"? The proof of Theorem 4.2 of the present paper estabhshes this statement 
for f{n) = 1.4999n. Can /(n) be made, for example, or 2*^? What is an upper bound for 
/ above which such a statement is false? Note that Theorem 3.1 establishes such a statement 
for all functions / : N ^ N if the shape is allowed to be infinite. However, when designing 
complex tile systems, a common challenge is to direct a group of tiles to stop growing, so 
it would be interesting to identify a family of finite shapes with a fast-growing gap between 
the two tile complexity measures. This would imply that sometimes it really helps to employ 
nondeterminism. 

2. We have showed that the optimization problem of finding precisely the smallest number of 
tile types to strictly self-assemble a shape is 5l2-hard. Can it be shown that for some a > 1, 
the solution is 5l2-hard to approximate within multiplicative factor a? This may be related 
to Question 1. 

3. Is there an a > 1 such that it is NP-hard to find an a- approximate solution to the minimum 
directed tile set problem? 

4. Shape-building is one common goal of self-assembly; pattern-painting is another. In partic- 
ular, it is possible to assemble some patterns, such as disconnected sets, if we change the 
definition of what is interpreted as the assembled object. We say that a TAS T = (T, a, 2) 
weakly self- assembles a set S" C if there is a subset B CT (the tile types that are "painted 
black") such that, for all a € ^n[7~], a~^{B) = S. In other words, the set of positions with 
a black tile is guaranteed to be S. In the case B = T, this definition is equivalent to strict 
self-assembly, but for B (^T the shape is allowed to grow outside the desired pattern using tile 
types from T\B to allow "extra computation room" for painting the pattern using tile types 
from B. Such a definition is appropriate for modeling practical goals such as self-assembled 
circuit layouts [21,25,28,31,32,46] or placement of guides for walking molecular robots [26]; 
see [22,23] for more discussion of the theoretical issues of weak self-assembly.^^ It remains 
open to prove or disprove analogs of Theorems 3.1 and 4.2, with "weakly" substituted for 
"strictly". In other words, is it possible to uniquely paint an infinite (resp. finite) pattern 
with a tile system, but every tile system that does so (resp. that does so with no extra tile 
types) is not directed? 

5. It remains open to prove or disprove analogs of Theorems 3.1 and 4.2, with "weakly" substi- 
tuted for "strictly" and with "strict" substituted for "directed". In other words, is it possible 

^^For example, 0*^^(5) « C'**'^(S') — O (log n/ log log n) for 5* an n x fc rectangle with n> k > log n/ log log n, but 
C''^(S) and C'*''^(S) increase steadily towards n as k decreases from log n/ log log ?i to 1; "counting" to the length of 
the rectangle and then stopping becomes more difficult as the rectangle's width decreases. 

^■^In contrast to the case for strict self-assembly, it can be shown that it is uncomputable to determine the minimum 
size tile assembly system that weakly self-assembles a given finite shape. This follows from a "Berry's paradox" 
argument, similar to the one used to show that Kolmogorov complexity is uncomputable, together with the fact that 
arbitrary algorithms may be simulated in a tile assembly system. Briefiy, assuming this quantity is computable, 
define a Turing machine M that on input c G Z"*" enumerates finite sets of points lying entirely on the positive a;-axis 
until a set S{c) is found whose "weak self-assembly tile complexity" exceeds c. Then for each c G define a tile 
system T that simulates M(c) in the second quadrant, using its output to place black tiles precisely on points in 
5(c). Since T requires only log c -|- 0(1) tile types, for sufficiently large c this contradicts the tile complexity of ^(c). 
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to uniquely paint an infinite (resp. finite) pattern witli a tile system, but every tile system 
that does so (resp. that does so with no extra tile types) must self-assemble more than one 
shape on which this pattern is painted? 

6. In [5] the authors show that for the special cases of tree and square shapes, the minimum 
directed tile set problem is in P. For trees, it is straightforward to verify that the minimum 
tile set is always directed, so the answer is the same whether or not we restrict attention 
to directed tile sets. What is the complexity of the minimum tile set problem restricted 
to squares? The polynomial-time algorithm given in [5] crucially depends on the existence 
of a polynomial-time algorithm for the directed shape verification problem of determining 
whether a given tile system strictly self-assembles a given shape and is directed. Removing 
the directed constraint on this shape verification problem, even when restricted to the case of 
squares, makes the problem coNP-complete [6,18,24]. Perhaps this means that the minimum 
tile set problem restricted to squares is hard as well. On the other hand, since this problem 
is sparse, Fortune's Theorem [16] implies that it cannot be coNP-hard (nor NP-hard by 
Mahaney's Theorem [27]) unless P = NP. 
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